Modeling Varying Pauses to Develo for Recognizing Noisy Conve
نویسنده
چکیده
The frequent appearances and varying acoustics of pauses in noisy conversational speech make it a problem to automatically generate an accurate phonetic transcription of the training data for developing robust acoustic models. This paper presents our proposal to exploit reliable phonetic heuristics of pauses in speech to aid the detection of varying pauses. Based on it, a stepwise approach to optimize pause HMMs was applied to the data of SPINEII project, and achieved a more correct phonetic transcription. The cross-word triphone HMMs developed using this transcription got absolute 5.2% word error reduction when compared to the baseline model.
منابع مشابه
Modeling and Forecasting Iranian Inflation with Time Varying BVAR Models
This paper investigates the forecasting performance of different time-varying BVAR models for Iranian inflation. Forecast accuracy of a BVAR model with Litterman’s prior compared with a time-varying BVAR model (a version introduced by Doan et al., 1984); and a modified time-varying BVAR model, where the autoregressive coefficients are held constant and only the deterministic components are allo...
متن کاملSpeech starter: noise-robust endpoint detection by using filled pauses
In this paper we propose a speech interface function, called speech starter, that enables noise-robust endpoint (utterance) detection for speech recognition. When current speech recognizers are used in a noisy environment, a typical recognition error is caused by incorrect endpoints because their automatic detection is likely to be disturbed by non-stationary noises. The speech starter function...
متن کاملVariability and stability in collaborative dialogues: turn-taking and filled pauses
Filled pauses have important and varied functions in turntaking behavior, and better understanding of this relationship opens new ways for improving the quality and naturalness of dialogue systems. We use a corpus of collaborative task oriented dialogues to provide new insights into the relationship between filled pauses and turn-taking based on temporal and acoustic features. We then explore w...
متن کاملVariability and stability in collaborative dialogues: turn-taking and filled pauses
Filled pauses have important and varied functions in turntaking behavior, and better understanding of this relationship opens new ways for improving the quality and naturalness of dialogue systems. We use a corpus of collaborative task oriented dialogues to provide new insights into the relationship between filled pauses and turn-taking based on temporal and acoustic features. We then explore w...
متن کاملTowards recognizing "non-lexical" words in spontaneous conversational speech
The purpose of this paper is to study and analyze both the non-lexical lled pauses and intended responses in conversational spontaneous speech, and how this can be useful in both automatic speech recognition and speaker identi cation systems. Through experiments, it was found that we are able to distinguish between words and non-lexical words in spontaneous speech using prosodic features. Conse...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002